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ABSTRACT 

The /c-truss is a type of cohesive subgraphs proposed recently for 
the study of networks. While the problem of computing most co- 
hesive subgraphs is NP-hard, there exists a polynomial time al- 
gorithm for computing /c-truss. Compared with /c-core which is 
also efficient to compute, &-truss represents the "core" of a fc-core 
that keeps the key information of, while filtering out less impor- 
tant information from, the /c-core. However, existing algorithms 
for computing /c-truss are inefficient for handling today's massive 
networks. We first improve the existing in-memory algorithm for 
computing /c-truss in networks of moderate size. Then, we propose 
two I/O-efficient algorithms to handle massive networks that can- 
not fit in main memory. Our experiments on real datasets verify the 
efficiency of our algorithms and the value of fc-truss. 

I. INTRODUCTION 

Given a graph G, the A>truss of G is the largest subgraph of G 
in which every edge is contained in at least (k — 2) triangles within 
the subgraph [15, 16]. The problem of truss decomposition in G 
is to find the (non-empty) /c-trusses of G for all k. 

The /c-truss is a type of cohesive subgraphs (or cohesive groups) 
of a network [18]. In the analysis of a massive network, it is often 
more fruitful, and more feasible, due to the large size of the net- 
work, to focus on smaller but more important areas of the network, 
i.e., subgraphs that reflect, and/or can be used to study, important 
properties of the network such as connectivity, robustness, self- 
similarity, centrality, etc. Thus, network analysts have attempted 
to identify various cohesive subgraphs for more efficient and effec- 
tive analysis of a network. 

In the literature, many notions of cohesive subgraphs were pro- 
posed. The basic one are the cliques (i.e., a subset of vertices that 
forms a complete subgraph) [21] and maximal cliques [7]. How- 
ever, the definition of clique is often too rigid and thus other more 
relaxed forms of cohesive subgraphs were proposed. The n-clique 
[22] relaxes the distance between any two vertices in a clique from 
1 to n. The k-plex [29] relaxes the degree of each vertex within a 
clique of c vertices from (c — 1) to (c — k). The n-clan [24] is 
the same as the n-clique except for imposing a constraint on the 
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diameter, while the n-club [24] removes the n-clique requirement 
from the n-clan. The quasi-clique can be either a relaxation on the 
density [1] or the degree [23, 26]. However, the computation of all 
the above cohesive subgraphs is NP-hard. 

All the above-mentioned cohesive subgraphs are relatively small 
sub- structures in a graph. They may be scattered all over the graph, 
and some of them may overlap largely with each other, while others 
are disconnected from each other. For example, two cliques may 
share all but one vertex with each other, or may be totally isolated 
from each other. 

On the contrary, /c-trusses are hierarchical subgraphs that repre- 
sent the cores of a network at different levels of granularity. In this 
sense, /c-truss is more similar to k-core [28], which is the largest 
subgraph of a graph in which every vertex has degree at least k 
within the subgraph. However, the /c-core was described by Sei- 
dman [28] as a "seedbed" within which cohesive subgraphs may 
precipitate (e.g. a /c-truss is a (k — l)-core but not vice versa) [15]. 
Thus, though containing other cohesive subgraphs such as cliques 
and /c-truss, the /c-core may also contain a lot more that can be fur- 
ther filtered out for more effective and efficient network analysis. 

Conceptually, the definition of /c-truss is also more rigorous than 
that of /c-core since /c-truss is defined based on triangles, which are 
known as fundamental building blocks of a network [32, 33, 25, 
6]. In a social network, a triangle implies a strong tie among three 
friends, or two friends having a common friend. Thus, by enforcing 
all edges to be contained in at least (k — 2) triangles, the /c-truss 
strengthens every (edge) connection in it by at least (k — 2) strong 
ties. Intuitively, we may consider this as in a social network, the 
more common friends two people have, the stronger their connec- 
tion it implies. On the contrary, in a /c-core we only have simple 
edge connection (i.e., degree). 

We give an example to illustrate the difference between /c-truss 
and /c-core as follows. 



Example 1. Figure 1(a) shows a graph, G, that displays the "seek- 
advice-from" relationship among the managers of a high-tech man- 
ufacturing firm in the west coast of the U.S. [19, 32, 15]. Figure 
1(b) and 1(c) show the 3-core and 4-truss of G, which are also 
given as an example in [15]. Note that no 4-core or 5-truss exist for 
G, i.e., there is no subgraph of G that has all vertices with degree 
at least 4 or all edges contained in at least (5 — 2) = 3 triangles 
within the subgraph. 

The figures show that the 3-core is not much different from the 
original graph G, while the 4-truss is significantly different from G. 
The clustering coefficient [33] of G, of the 3-core, and of the 4-truss 
is calculated to be 0.51, 0.65, and 0.80, respectively, indicating that 
the 4-truss tends to form clusters or communities at a degree much 
higher than G and the 3-core. 
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Figure 1: (a) A manager- relationship graph, G; (b) the 3-core 
of G (no 4-core exists); (c) the 4-truss of G (no 5-truss exists) 

The 4-truss also satisfies the requirement of a 3-core by defini- 
tion (not vice versa); however, the 4-truss further filters out those 
vertices with lower local clustering coefficient in the 3-core, i.e., 
those vertices that do not tend to form clusters or tightly-knit com- 
munity structures with others. 

From another angle, we may also see that the 4-truss contains all 
the cliques with more than 3 vertices, namely {4,8,10,18}, {4,8,18,21}, 
{5,10,18,19}, {7,14,18,21}, and {10,15,18,19}, but it filters out all 
the other less cohesive sub- structures that exist in the 3-core. 

This example also shows that, as the "core" part of the /c-core, 
the /c-truss has a smaller size and a clearer display of the essential 
part of a network. Thus, /c-truss can be more suitable than /c-core in 
applications such as visualization and fingerprinting of large-scale 
networks [3], interpretation of cooperative processes in complex 
networks [8], and analysis of network connectivity [4], etc. □ 

On the computational aspect, the non-hierarchical cohesive sub- 
graphs are mainly evolved from the cliques and therefore expensive 
to compute. On the contrary, computing the hierarchical cohesive 
subgraphs, /c-core and /c-truss, has a polynomial time complexity. 

Cohen [15] proposed an algorithm for truss decomposition, which 
requires random access to vertices/edges and hence the entire input 
graph to be resident in main memory. However, real world net- 
works have grown drastically in recent years. It is unrealistic to 
assume that these graphs/networks can always fit in main memory. 

Recently, Cohen [16] also proposed a parallel algorithm based 
on the MapReduce framework which does not require keeping the 
entire input graph in any single machine. To compute the k -truss, 
the algorithm iteratively invokes a MapReduce procedure for trian- 
gle listing whenever there are some edges that are contained in less 
than (k — 2) triangles. The iterative counting of triangles enforced 
in the definition of /c-truss requires many iterations of a main pro- 
cedure that makes parallelization of the entire process difficult. 

In this paper, we first propose an efficient in-memory algorithm 
for truss decomposition that has the same worst-case complexity 
as the lower-bound complexity of in-memory triangle listing [30]. 
We also show that our algorithm is significantly more efficient than 
the existing algorithms for truss decomposition [15, 16] in small 
networks. However, for processing massive networks that cannot 
fit in main memory, the existing algorithms become impractical due 
to huge I/O cost. 

We develop two I/O-efficient algorithms for truss decomposition 
in massive networks that cannot fit in memory. The first one is a 
bottom-up approach that employs an effective pruning strategy by 



removing a large portion of edges before the computation of each 
/c-truss, thus significantly reducing both disk I/O cost and search 
space during truss decomposition. The second one takes a top- 
down approach, which is tailor-made for applications that prefer 
the /c-trusses of larger values of k, as they often represent the heart 
or backbone of a network. 

We evaluate our algorithms on a range of real world networks. 
For networks of moderate sizes that can reside in memory, the re- 
sults show that our in-memory algorithm significantly improves the 
existing in-memory algorithm [15]. For larger networks that can- 
not fit in memory, the results show that our I/O-efficient method is 
much more efficient than the existing MapReduce algorithm [16]. 

Organization. Section 2 formally defines the problem and gives 
the basic notations. Section 3 describes the in-memory algorithms 
and identifies their limitations. Section 4 gives an overview of the 
two I/O-efficient algorithms and highlights the challenges. Sections 
5 and 6 discuss in details the bottom-up and top-down approaches. 
Section 7 reports the experimental results. Section 8 discusses the 
related work and Section 9 concludes the paper. 

2. PROBLEM DEFINITION 

We consider undirected, unweighted simple graphs. Given a 
graph C, we denote by Vg and Eg the vertex set and edge set 
of G, respectively. We define n = \Vg\ as the number of vertices, 
and m = \Eg | as the number of edges of G. We define the size of 
G, denoted by \G\, as \G\ = m + n. We use nb(v) to denote the 
set of neighbors of a vertex v, that is, nb(v) — {u : v) £ Eg}- 
We define the degree of v in G as deg(v) = \nb(v)\. 

Unless otherwise specified, we assume that a graph is stored in 
its adjacency list representation (whether in memory or on disk). 
Each vertex in the graph is assigned a unique ID. In the adjacency 
list representation, vertices are sorted in ascending order of their 
IDs. Given any two vertices u and v, we use u < v or v > u to 
denote that u is ordered before v. 

A triangle in G is a cycle of length 3. Let u,v,w £ Vg be the 
three vertices on the cycle, we denote this triangle by A uvw . In 
addition, we denote the set of triangles of G by Ac. On the basis 
of triangles, we define the support of an edge [15] as follows. 

Definition 1 (SUPPORT). The support of an edge e = (u,v) £ 
Eg in G, denoted by sup(e, G), is defined as \{A UVW : A uvw £ 
Ag}|. When G is obvious from content, we replace sup(e, G) by 
sup(e). 

The support of an edge e in G is simply the number of triangles 
in G that contain e. Now we define the notion of /c-truss [15, 16]. 

Definition 2 (/c-TRUSS). The /c-truss of G, where k > 2, de- 
noted by Tk, is defined as the largest subgraph of G, such that 
VeeE Tk , sup(e,T k ) > (k - 2). 

By definition, the 2-truss is simply G itself. 

We define the truss number, or trussness, of an edge e in G, 
denoted by 0(e), as max{/c : e £ Er k }- It follows that given 
0(e) = k, we have e £ Er k but e ^ ET k+1 . We use k m ax to 
denote the maximum truss number of any edge in G. From the 
truss number comes our definition of the /c-class. 

Definition 3 (/c-CLASS). The k-class of G, denoted by &k, is 
defined as {e : e £ Eg, 0(e) = k}. 

Problem definition. The problem of truss decomposition stud- 
ied in this paper is defined as follows. Given a graph G, compute 
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Table 1: Frequently Used Notations 



TV! f\ t o 1 1 rvn 


l^caLI ipilUIl 


G = {V G ,E G ) 


A undirected, unweighted simple graph G 


n\ m 


The number of vertices/edges in G 


\G\ 


The size of G, \G\ = m + n 


nb(v) 


The set of neighbors ofv£ Vq 


deg(v) 


The degree of vertex v G Vq 


A uvw 


A triangle formed by u, v and w 


sup(e) 


The support of e in G 


sup(e, H) 


The support of e in a subgraph H of G 


T k 


The fc-truss of G 


0(e) 


The truss number of e in Cr 


4>(e,H) 


The truss number of e in a subgraph H of G 


$k 


The fc-class ofG,$ fe = {e:eG E G ,4>(e) = k} 


cp(e) 


The estimated lower bound of 0(e) for e £ 




The estimated upper bound of 0(e) for e G Eq 


M 


The size of available main memory 


B 


The disk block size 


scan(N) 


S(N/B) 



the k-truss of G for all 2 < k < kmax- Equivalently, the k-truss 
can be obtained from the set of edges Er k = U j>k ®j> i-e-> the 
union of all edges with truss number at least k. When the input 
graph G cannot fit in memory, we propose I/O-efftcient algorithms 
to compute the k-classes. 

The following example illustrates the concept of /c-truss. 




5-class edge 



4-class edge 



3-class edge 



2-class edge 



Figure 2: A graph G and the /c-classes of G (2 < k < 5) 

Example 2. In the graph G in Figure 2, the different types of 
edges indicate the different /c-classes, where 2 < k < 5. In G, 
the 2-class $2 has a single edge (i, k), i.e., $2 = {(^, k)}, since 
(i, k) is the only edge in G with support 0. The 3-class $3 con- 
sists of 9 edges, given by $ 3 = {(d,#), (d, k), (d,l), (e,/), (e,g), 
(/) 9)i h)i (<7> k), (g, I)}. The 4-class <E>4 contains 6 edges, given 
by $4 ={(/,/0,(/,i),(/,j)> (h,i),(h,j),(i,j)}. The 5-class 
$5 consists of 10 edges, given by $5 = { (a, &) , (a, c) , (a, d) , (a, e) , 
(6, c), (6, d), (b, e), (c, d), (c, e), (d, e)}. We have /c macc = 5. 

From the /c-classes we obtain the /c-trusses as follows. The 2- 
truss T2 is simply G itself. The 3-truss T3 is the subgraph of G 
formed by the edge set ($3 U $4 U $5). The 4-truss T 4 is the 
subgraph formed by ($4 U $5), and the 5-truss T5 is the subgraph 
formed by $5. We can verify that for 2 < k < 5, each edge of Tk 
is contained in at least k — 2 triangles in The /c-trusses display 
the hierarchical structures of G at different levels of granularity, as 
depicted by the shading of different gray scales in Figure 2. □ 

Table 1 lists the notations that are frequently used in the paper. 
When analyzing I/O complexity of our algorithms, we adopt the I/O 
model in [2] : M is the main memory size and B is the disk block 
size, where 1< 5 < M/2. Data is read/written in blocks from/to 
disk. Thus, reading/writing a piece of data of size N from/to disk 
requires (N/B) I/Os. We also define scan(N) = B(§), where 
N is the amount of data being read or written from/to disk. 



Algorithm 1 Truss Decomposition 

Input: G= (V G ,E G ) 

Output: the /c-truss for 3 < k < kmax 

1. k <- 3; 

2. for each e = (u, v) G Eq do 

3. sup(e) = \nb(u) P\nb(v)\; 

4. while(3e = (it, v) such that sup(e) < (k — 2)) 

5. W 4- (nb(u) nnb(v)); 

6. for each e 1 = (u, w) or e' = (v,w), where w £ W, do 

7. sup(e') <— (sup(e') — 1); 

8. remove e from G; 

9. output G as the fc-truss; 

10. if '(not all edges in G are removed) 

11. fc<-(fc + l); 

12. goto Step 4; 



3. IN-MEMORY TRUSS DECOMPOSITION 

We first describe an existing algorithm for truss decomposition. 
Then, we propose an improved algorithm and prove that its time 
complexity is bounded by that required for triangle listing. Finally, 
we identify the limitation of the in-memory algorithms. 

3.1 The Existing In-Memory Algorithm 

We outline the algorithm of [15] for truss decomposition in Al- 
gorithm 1 . The algorithm starts with an initialization by computing 
the support of every edge in G (Steps 2-3). The intersection of 
nb(u) and nb(v) for each edge e = (u, v) returns the set of ver- 
tices that form triangles with u and v, and thus the cardinality of 
the intersection gives the support of e. 

After the initialization, for each k starting from k = 3, the algo- 
rithm removes every edge e — (u,v) with support less than (k — 2), 
since e cannot be in the /c-truss by definition (Steps 4-8). Remov- 
ing e, however, may also invalidate all triangles consisting of e, i.e., 
\/A uvw , where w G W = (nb(u) D nb(v)), A uvw is no longer a 
valid triangle after the removal of e = (u, v). Thus, we also decre- 
ment the support of the other two edges (u, w) and (v, w) for each 
Auvw, where w G W. This process is repeated iteratively until all 
the remaining edges in G have support at least (k — 2), which is 
the /c-truss. 

If there are still some edges in G not yet removed, we continue 
with the next k by repeating the above process, i.e., Steps 4-9. 

In Step 4 of Algorithm 1, we need to find edges with support 
less than (k — 2). This step can be efficiently processed by using 
a queue. Whenever we compute (during initialization) or update 
(upon the removal of an edge) the support of an edge, we push the 
edge into the queue if its support becomes less than {k — 2) or 
update its position in the queue if the edge is already in the queue. 

In Step 8 of Algorithm 1, we remove edges from G. Explic- 
itly deleting edges from G during every step of the process can 
be expensive since it involves updating the adjacency lists of u 
and v for each edge e = (u,v), which requires time linear in 
(deg(u) + deg(v)). Thus, an implicit approach by simply marking 
that e has been deleted in nb(u) and nb(v) is more efficient. 

Complexity. Algorithm 1 requires 0(m + n) memory space to 
keep the input graph as well as the support of all edges in mem- 
ory. The initialization (Steps 2-3) can be made faster using the 
in-memory triangle counting algorithm [27, 20]. However, Step 5 
still requires 0(deg(u) + deg(v)) time for each edge e = (u, v) 
processed, thus giving a total of 0(^2^ u v )^e g (deg(u) + deg(v)) 
= 0(%2 veV (deg(v)) 2 ) time. This can be expensive for large 
graphs with vertices of high degree. 
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Algorithm 2 Improved Truss Decomposition 

Input: G= (V G ,E G ) 

Output: the /c-class, 6^, for 2 < k < kmax 

1. k <- 2, <S> k «- 0; 

2. compute sup(e) for each edge e G £?g; 

3. sort all the edges in ascending order of their support; 

4. while(3e such that sup(e) < (k — 2)) 

5. let e = (u, v) be the edge with the lowest support; 

6. assume, w.l.o.g., deg(u) < deg(v); 

7. for each w G nb(u) do 

8. if((u,w) G #g) 

9. sup((u,w)) ^— (sitp((ix, — 1), 
sitp((?;, tu)) ^— (swp(( , u, it;)) — 1); 

10. reorder (u, w) and (v, w) according to 
their new support; 

11. <- ($ fc U {e}); 

12. remove e from G; 

13. if (nor all edges in G are removed) 

14. fc<_(fc + l); 

15. goto Step 4; 

16. return <E>j, for 2 < j < k; 



3.2 An Improved Algorithm 

We now present an improved algorithm for truss decomposition, 
as shown in Algorithm 2. 

The algorithm also starts from an initialization that computes the 
support of each edge in G. We then sort all the edges in ascending 
order of their support. The computation of the support of the edges 
can be done in 0(m 1-5 ) time by the in-memory triangle counting 
algorithm [27, 20]. The sorting can be done in 0(m) time with 
0(m) space using bin sort. The sorted edges are then kept in an 
array A, in a way similar to the sorted degree array in [5]. 

After the initialization, for each k starting from k = 2, the al- 
gorithm iteratively removes a lowest support edge e = (u, v), i.e., 
the first edge in the sorted edge array A, while the support of e is 
not greater than k — 2. The removed edge e is added to since 
sup(e) < (k — 2) and thus e cannot be in <3>fc+i . Upon the removal 
of e, we also decrement the support of all other edges that form a 
triangle with e, and update their new positions in the sorted edge 
array A. The membership test of whether (v,w) G Eg at Step 8 
can be done efficiently by keeping Eg in a hashtable. The location 
of each edge in A is stored in the hashtable. Each update can be 
done in A in constant time in a way similar to that in the sorted de- 
gree array in [5]. We do not explicitly remove e from G, but simply 
move a pointer one position forward in A to point to the next edge 
with the lowest support (i.e., all edges to the left of the pointer have 
been removed). This process continues until all edges with support 
less than or equal to (k — 2) are removed. 

In this way, we compute each /c-class until all edges in G are 
removed. 

Algorithm 2 is similar to Algorithm 1 but there is one subtle 
difference between Steps 5-6 of Algorithm 1 and Steps 6-8 of Al- 
gorithm 2 in the search for A UV w. This difference significantly 
reduces the time complexity of Algorithm 2 as shown in Theorem 
1 . We also show by experiments that Algorithm 2 is indeed signifi- 
cantly faster than Algorithm 1 . 

The correctness of Algorithm 2 is apparent since the algorithm 
essentially computes the fc-truss by its definition. We show the 
complexity of this algorithm as follows. 

Theorem 1 . Algorithm 2 computes the k-truss, for all k > 3, 
in 0(m 15 ) time using 0(m + n) space. 

PROOF. The support computation at Step 2 uses 0(m 1-5 ) time 



and 0(m + n) space [27, 20], while Step 3 uses 0(m) time with 
0(m) space using bin sort. 

In the main iterative loop (i.e., Steps 4-12), Step 8 can be done 
in expected constant time by hashing and all other individual oper- 
ations at other steps (except Step 7) can done in constant time (see 
the details in the algorithm description above). Thus, the total time 
depends on how many times these individual operations are exe- 
cuted. Steps 11-12 are obviously executed 0(m) times. Therefore, 
we only need to analyze how many times the individual operations 
at Steps 8-10 are executed. 

For each edge e = (u,v) removed, where deg(u) < deg(v), as 
indicated by Step 6, the operations at Steps 8-10 are executed for 
at most deg(u) times. Let nb>(u) be {v : v G nb(u), deg(v) > 
deg(u)}. Thus, the loop at Steps 7-10 is executed for \nb>(u)\ 
times for the vertex u, and the operations at Steps 8-10 are executed 
for at most (deg(u) • \nb>(u)\) times. 

We prove that for any u G Vb, \nb>(u)\ < 2y / m. If deg(u) < 
^Jm, then trivially \nb>(u)\ < deg(u) < 2y / m. If deg(u) > 
\fm, we prove by contradiction. Suppose \nb>(u)\ > 2y / m. 
Then 'E.enfe>(^) deg{y) > (\nb>(u)\-deg(u)) > (2^V™)= 
2m, contradicting the fact that J2 v ev G deg(v) = 2m. Thus, the 
total time is Y, u ev G ( de 9( u ) ' \ nb >( u )\) < T, u ev G ( de 9( u ) ' 
2y / m) — (m • 2y / m) — 0(m 1-5 ). Summing up, the total time 
complexity is 0(m 1-5 ). 

Algorithm 2 uses 0(m + n) space to hold the input graph, 0(m) 
space for the hashtable for edge membership test at Step 8, and 
0(m) space for the edge arrays for the bin sort. Thus, the space 
complexity is 0(m + n). □ 

Note that the complexity, both time and space, of Algorithm 2 
in the worst case is the same as the lower-bound complexity for 
triangle listing [30]. 

3.3 Limitations of In-Memory Algorithms 

Both Algorithm 2 and Algorithm 1 are in-memory algorithms 
that require 0(m + n) memory space. The constant factor is small 
as we only need extra space for the queue or sorted edge array, the 
hashtable or for marking whether an edge is deleted, in addition to 
the space used to hold the input graph. However, for processing 
a large graph in practice, any small multiplicative factor results in 
huge amount of extra memory space needed. With the drastically 
increased volume of graphs/networks in recent years, it is unrealis- 
tic to assume that the input graph can always fit in memory. 

When the input graph cannot fit in memory, Algorithm 2 and Al- 
gorithm 1 reveal that random access to vertices and edges stored on 
disk is necessary, which can incur prohibitively high I/O cost. The 
effect of locating an edge to be removed may trigger the removal 
of other edges and this propagating effect can spread to random lo- 
cations in the graph. Moreover, the removal of an edge may lead to 
multiple iterations of support downgrading. These may become se- 
rious bottlenecks when the relevant parts of the graph do not reside 
in memory. Therefore, identifying distinct iterative steps and the 
relevant subgraphs for such iterations can be a key factor in reduc- 
ing the disk I/O cost for truss decomposition. How to design I/O 
efficient algorithms is the main focus of the subsequent sections. 

4. I/O-EFFICIENT DECOMPOSITION 

In Sections 5 and 6, we will present two different algorithms that 
aim at reducing the I/O cost for truss decomposition in large graphs 
that cannot fit in memory. We briefly discuss the main objectives 
of the two algorithms and their differences in this section. 
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• Bottom-up approach. The bottom-up approach starts from 
the smallest k, i.e., k = 2. The algorithm determines a lower 
bound on the truss number of the edges in G, extracts a can- 
didate subgraph that contains all edges in the /c-class, 
computes in memory, removes all edges in from the 
input graph C, and then moves on to compute <3>fc+i. This 
process repeats iteratively until all edges in G are removed. 

• Top-down approach. The top-down approach starts from 
largest possible k. The algorithm determines an upper bound 
on the truss number of the edges in G, extracts a candidate 
subgraph that contains all edges in computes in mem- 
ory, removes all irrelevant edges from G, and then moves on 
to compute &k-i- 

The bottom-up approach extracts a smaller candidate subgraph 
than the top-down approach in most cases and is also more effective 
in removing irrelevant edges for the computation of the subsequent 
A;-classes. The top-down approach, though less efficient when all 
the /c-classes need to be computed, is particularly suitable for appli- 
cations that demand only the top /c-trusses, i.e., the /c-trusses with 
the largest values of k, which is reasonable since those top /c-trusses 
are the more important and core part of a graph or network. 

The main idea of both bottom-up and top-down approaches is 
simple but there are a number of technical challenges: (1) all the 
steps (e.g., lower/upper bound estimation, candidate subgraph ex- 
traction) need to be made I/O-efficient, and it is not straightforward 
to avoid random access in these steps; (2) many steps in our algo- 
rithms apply local computation of global result, ensuring the cor- 
rectness of the /c-class globally is a challenge; and (3) pruning is 
essential for truss decomposition in large graphs but effective and 
correct pruning is difficult, especially for the top-down approach, 
due to the tight inter-connections among edges via triangles. We 
address all these issues in our algorithms. 

5. BOTTOM-UP TRUSS DECOMPOSITION 

In this section, we discuss in details the bottom-up approach for 
truss decomposition. We first outline the framework of the algo- 
rithm, which consists of the following two main stages. 

Lower-bounding: This stage forms the basis for the later stage of 
truss decomposition. We compute a nontrivial lower bound 
on the truss number of each edge. We also remove the 2- 
class, $2, which can be directly obtained from lower-bounding, 
to reduce the input size to the subsequent computation. 

Bottom-up truss decomposition: This stage computes the k- 
classes iteratively bottom-up, i.e., start from the 3-class to 
the k m ax -class. We use the lower bound to extract a small 
candidate subgraph, which can be loaded in main memory 
in most cases to avoid random disk access. After comput- 
ing each fc-class, we remove all the edges in the A;-class, thus 
reducing the costs for computing the remaining /c-classes. 

5.1 Lower Bounding 

We first define the concept of neighborhood subgraph of a set of 
vertices, which is frequently used in our algorithms. 

Definition A (NEIGHBORHOOD SUBGRAPH). Let U C V G . 
The neighborhood subgraph of U, denoted by NS(U), is a sub- 
graph of G which is defined as NS(U) = (V N s(u), E NS (u))> 
where V N s(u) = U U {v : v G nb(u),u G [/} and E NS (u) — 
{(u,v) : (u,v) eE G ,ue U}. 



Algorithm 3 LowerBounding 
Input: G=(V G ,E G ) 

Output: $2, and a new graph G ne w, each edge e in G new is asso- 
ciated with a lower-bound on the truss number, ip(e) 

1. Ve G E G : (p(e) <- 0; 

2. whi\e(not all edges in G are removed) 

3. partition V G into V = {Pi , P 2 , • • • , P P }, 

s.t. each Pi G V fits in memory; 

4. for each Pi £ V do 

5. let H be the neighborhood subgraph NS(Pi) of Pi; 

6. compute <j>(e, H) for each edge e in H by Algorithm 2; 

7. (p(e) <— max{ip(e), 0(e, H)} for each edge e in H; 

8. <E>2 ^— {internal edge e of if : sup(e) = 0}; 

9. output <E>2 as part of $2, 

and remove <E> 2 from H and G; 
10. output each remaining internal edge e of H, with (p(e), 

as part of G n ew , and remove e from G; 



Intuitively, the neighborhood subgraph of U is a subgraph ob- 
tained by adding to G[U] those edges from the vertices in U to 
those neighboring vertices of U in V G \U, where G[U] is the in- 
duced subgraph of G by U. We refer to U and Eq[u] as me inter- 
nal vertices and internal edges of NS(U), respectively, while the 
remaining vertices and edges in NS(U) are called external vertices 
and external edges. 

We now outline the algorithm for lower-bounding the truss num- 
ber of the edges in Algorithm 3. The algorithm iterates over Steps 
2-10, computes the lower-bound on the truss numbers for a portion 
of edges at each iteration, and removes these edges, until all edges 
in the input graph G are removed. 

At each iteration, we partition the vertex set of the current graph 
G (note that G is shrinking at each iteration) into p > 2\G\/M 
parts (Step 3). Chu and Cheng proposed three linear- time parti- 
tioning algorithms for triangle listing in large graphs that cannot 
fit in memory [13]. The first one sequentially partitions G, which 
is fast but does not have a theoretical guarantee on the number of 
iterations. The second one uses a dominating vertex set of G as 
seeds to guide the partitioning, which uses 0(n) memory space but 
the number of iterations can be bounded by 0(m/M). The third 
one is a randomized partitioning algorithm that removes the space 
requirement of the second algorithm and still bounds the number 
of iterations by 0(m/M) with high probability. Since their algo- 
rithms partition G into p approximately equal- sized neighborhood 
subgraphs, where each subgraph fits in main memory, we can apply 
any of them in our algorithm. 

For each of the p neighborhood subgraphs, H, we compute the 
truss number 0(e, H) for each edge e locally in H. Since H fits in 
main memory, we apply the in-memory truss decomposition algo- 
rithm, Algorithm 2. Then, we assign 0(e, H) as the lower bound 
ip(e) of the truss number of e globally as in G. The following 
lemma establishes the relationship between the local truss number 
0(e, H) in H and the global lower bound ip(e) in G. 

Lemma 1. Given a graph G and any neighborhood subgraph 
H ofG, we have 0(e) > <f>(e, H). 

PROOF. The proof follows directly from the fact that if is a 
subgraph of G. □ 

Lemma 1 implies that the maximum 0(e, H) of any neighbor- 
hood subgraph H (at any iteration) can be used as a lower bound 
of the truss number of an edge e in G, as done in Steps 6-7. 

In the process of computing <j>(e,H) by Algorithm 2, we can 
also obtain sup(e) for each internal edge e = (u, v) in H. Note 
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that sup(e) computed locally in H is the exact support of e globally 
in the current G, since both nb(u) and nb(v) are in H as u and v 
are internal vertices. Here in Steps 8-9, however, we only need to 
determine whether sup(e) = for an internal edge e in H. If 
sup(e) = 0, then by definition of truss e belongs to the 2-class 
$2. Thus, we output all internal edges in H with support as the 
2-class and also remove them from G to reduce the search space in 
the subsequent iterations. 

Then at Step 10, we output each remaining internal edge e of H, 
with lower bound (p(e). All these edges form a new graph, G new , 
which is stored as a list of edges on disk. 

At the end of each iteration, all internal edges of each neigh- 
borhood subgraph H are removed from G. At the next iteration 
when the shrunk graph G is re-partitioned, some of the external 
edges in the previous G at the previous iteration now become in- 
ternal edges, which are processed and finally also removed at the 
end of the current iteration. This process continues until the shrunk 
graph G can finally fit in main memory, in which case all edges are 
internal edges. 

5.2 Bottom-Up Truss Decomposition 

The second stage of the bottom-up approach is the process of 
bottom-up truss decomposition in the new graph G ne w obtained by 
Algorithm 3, where each edge e in G new is associated with a lower- 
bound on the truss number <p(e). We give the algorithm outline in 
Algorithm 4 and Procedure 5. 

Algorithm 4 starts from k = 3, extracts a candidate subgraph 
H for computing (Steps 3-5), compute from H (Step 6), 
and then moves on to (k + 1), until all edges in Gnew are removed 
(edges are removed at Step 6). 

The candidate subgraph H is extracted from G ne w as follows. 
We first scan G ne w once to obtain the set of candidate vertices Uk . 
Then, we scan G ne w again to extract all edges that have at least one 
end vertex in Uk, which gives H = NS(Uk). If Uk fits in memory, 
a single scan of G new suffices. 1 We also need to convert the set of 
edges extracted from G ne w into the adjacency list representation. 
If H fits in memory, the conversion is straightforward. 2 

The correctness of the candidate subgraph for computing will 
be proved in Theorem 2. We now discuss the main step in Algo- 
rithm 4, i.e., Step 6, which computes &k from the candidate sub- 
graph H by Procedure 5 as follows. 

Steps 1-5 of Procedure 5 are similar to Steps 2-11 of Algorithm 
2, except that Procedure 5 only computes &k and focuses on the 
internal edges of H. After outputting an edge e as a $^ edge, 
we remove e from H to reduce search space. The correctness and 
completeness of computed will be proved in Theorem 2. 

At the end of the procedure, we also remove all edges in 
from Gnew to reduce both search space and disk I/O cost for com- 
puting the remaining /c-classes. This step requires reading G new 
and re- writing the reduced G ne w back to disk. Checking whether 
an edge of G new is in can be done efficiently by keeping 
in a hashtable in memory. If cannot fit in memory, we need 
|<3>fc|/M scans of G new to remove all edges in from G new . 

*It is very rare that even Uk cannot fit in main memory of an ordi- 
nary PC today, for which case G new and hence the input graph G 
must be extra-ordinarily large. In this case, we need \ Uk\/M scans 
of Gnew to extract H. 

2 Otherwise, we can first write the edge list of H to disk when it is 
extracted from G new . Next, we read in H again and distribute the 
edges into different buckets on disk, according to the end vertices. 
Then, we read in each bucket, which fits in memory (otherwise we 
can use smaller buckets). And finally, we do the conversion for the 
edges in the bucket in memory and write the converted adjacency 
list to disk. 



Algorithm 4 Bottom-Up Truss Decomposition 

Input: G = (Vg, Eg) 

Output: the fc-class, <E>fc, for 2 < k < k max 

1. call LowerBounding (i.e., Algorithm 3); 

2. k <- 3; 

3. let U k = {v : v G V Gnew , 3e = (u, v) G E Gnew , s.t. <p(e) < k}; 

4. let H be the neighborhood subgraph NS(Uk) of Uk', 

5. scan G n ew to extract H; 

6. call Bottom-Up-Procedure(H, k); 

7. if (not all edges in G new are removed) 

8. ife<-(ife + l); 

9. goto Step 3; 



Procedure 5 Bottom-Up-Procedure(H, k) 

1. compute sup(e) for each internal edge e of H in memory; 

2. while(3 internal edge e = (u, v) of H s.t. sup(e) < k — 2) 

3. output e as a edge; 

4. for each triangle A uvw in H containing e do 

5. decrease the support of (u, w) and (v, w) by 1; 

6. remove e from H and G ne w', 



In the above discussion, we have assumed that H can fit in mem- 
ory, which is true in most cases. When the input graph is very large 
or the available memory size is small, H may not fit in memory. In 
this case, we need to scan H multiple times to compute $fc, in a 
similar way as in Algorithm 3. We partition H into p — 2\H\/M 
subgraphs and compute each subgraph in memory. The computa- 
tion in each subgraph in memory is the same as in Procedure 5. 
Since we focus on the internal edges of each subgraph only, itera- 
tively processing the above steps in memory finds all internal edges 
of H that are in The detailed algorithm description is omitted 
here for simplicity of presentation but is attached in Appendix. 

The following example illustrates how bottom-up truss decom- 
position is processed. 

Example 3. We illustrate the two stages of the bottom-up algo- 
rithm on our running example in Figure 2. In stage 1, Algorithm 
3 partitions G into three subgraphs Pi , P2 , P3 as shown in Figure 

3. Initially all the edges e of G have a lower bound ip(e) = 2. 
Let <& k (G') be the /c-class of graph G' . Given NS(Pi), Algo- 
rithm 2 returns $2 (A) = {(d,l), (g,l)}. All the remaining edges 
in NS(Pi) belong to $4 (A), and tp(e) for each such edge e is 
set to 4. In NS(P2), the local classes are computed as $2(^2) = 
{ (/, i) , (/, j) } and all the other edges in NS(P 2 ) belong to $3 (P2) ; 
the lower bound ip(e) of each edge e in <3>3(P2) is increased from 
2 to 3. P3 is processed in the same way. We add the internal edge 
(z, k) of NS(Ps) to $2 and remove it, and update the lower bounds 
of the 6 edges in the clique {/, i,j} to 4. At the end of stage 
1, all edges which have not been removed are written to disk as 
Gnew In stage 2, we compute the /c-classes for k > 3. To compute 
the 3-class, it is sufficient to load into main memory the subgraph 
NS(U3), which is shown in Figure 4(a). Procedure 5 computes 
the 3-class from this graph, giving $3 = {(d, g), (d, /), (g, /), 
(g,k),(h,g), (d,k), (e, /), (e, g), (/, g)}. We remove $3 from 
G n ew • We check that Gnew is not empty at Line 7 of Algorithm 

4, so continue with k = 4. NS(Ua) is loaded into memory. It 
is shown in Figure 4(b). $4 is determined in this step to be the 6 
edges in the clique {/, /i, z, j}. This is deleted from Gnew- The 
remaining graph will trigger another call of Procedure 5 at which 
point the 5-class will be determined. □ 
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(a) Pi (b) P 2 (c) P 3 

Figure 3: A partition V = {Pi, P 2 , P3} of G where Pi = 

{a, 6, c, /}, P 2 = {d, e, /, P 3 = {/i, i, j, k} 




(a) NS(U k ) for fc = 3 (b) NS(U k ) for fc = 4 



Figure 4: Subgraphs of relevant vertices for bottom-up steps 

5.3 Correctness and Complexity 

We first prove the correctness of Algorithm 4 as follows. 

THEOREM 2. Given a graph G, Algorithm 4 correctly com- 
putes in G,for 2 < k < k ma x- 

PROOF. First, $2 is correctly computed since Algorithm 3 com- 
putes the exact support (in G) of each internal edge of a subgraph 
H and we output all internal edges with support as $2 edges. All 
remaining edges in G must have support at least 1 and thus cannot 
be in $2. The remaining edges are written to disk as G new- 
Next, we prove each where 3 < k < kmax, is computed 
correctly. We first make an assumption, Assumption 1: the can- 
didate subgraph H contains all edges of as internal edges. If 
Assumption 1 is true, then clearly Procedure 5 (or Procedure 9 in 
Appendix if H cannot fit in memory) correctly computes since 
the procedures simply follow the definition of Thus, what re- 
mains to be proved is Assumption 1. 

Let us make another assumption, Assumption 2: G ne w contains 
all edges of &k at Step 3 of Algorithm 4 when H is to be extracted 
for At Steps 3-5 of Algorithm 4, we extract all edges e with 
(p(e) < k from G new as well as their neighboring edges that may 
form triangles with e. From Lemma 1, all edges with 0(e) = k 
will be extracted as internal edges. Thus, if Assumption 2 is true, 
then Assumption 1 must be also true. 

Now, we show that Assumption 2 is also true. When k — 3, 
Assumption 2 is clearly true since initially G new contains all edges 
in <£fc, for k > 3. Referring to the algorithm, G ne w is only modified 
when some is computed and all edges in <3>fc are removed at Step 
6 of Procedure 5 (or Step 12 of Procedure 9 in Appendix). Since all 
edges in are not used in the computation of for all j > k, 
removing all edges of &k from G new does not take away any edge 
in $j . Thus, Assumption 2 is also true and we have established the 
correctness of Algorithm 4. □ 

Next, we analyze the complexity of Algorithm 4. 

Theorem 3. Let G be the input graph. Let Pi be the set of 
candidate subgraphs extracted at Steps 3-5 of Algorithm 4. Let 
Pi' C Pi be the subset of candidate subgraphs that cannot fit in 
memory. Algorithm 4 computes &k in G,for 2 < k < k ma x, using 



0((§ + kmax)scan{\G\) + £ HeK , I A ^D I/0s and °( m1 ' 5 + 
T,Heu \ Ah \) CPU time. 

Proof. Algorithm 4 calls Algorithm 3 once, which has the same 
I/O complexity as that of I/O-efficient triangle listing algorithm 
[13], i.e., 0(§scan(\G\)) I/Os. If a candidate subgraph H fits in 
memory, extracting H at Steps 3-5 of Algorithm 4 and processing 
Procedure 5 require 0(scan(\G n ew |)) = 0(scan(\G\)) I/Os. For 
each H £ PL' ', where H cannot fit in memory, we take a heuristic 
approach, i.e., Procedure 9, which requires only a few scans of H 
for real graphs in practice. In the worst case, if the procedure does 
not terminate after c scans of PL for some constant c, we can simply 
take a naive approach by removing the edges with lowest support 
and the corresponding triangles one by one, which gives the worst 
case I/O complexity of O ( | A# | ) . 

Algorithm 3 uses 0(\ A G \) = Oim 1 ,5 ) CPU time in the worst 
case, since its complexity is the same as triangle listing [13]. For 
the computation of each we at most enumerate all triangles in 
the corresponding H, which gives 0(^2 Hen | Ah |) CPU time. □ 

In the worst case, \A H \ = OQEh] 1 ' 5 ). In practice, \A H \ is 
significantly smaller than \Eh\ 1,5 for real world sparse graphs. 

6. TOP-DOWN TRUSS DECOMPOSITION 

In many applications, one may not want all /c-trusses; instead, 
one may be interested in only the top-t /c-trusses for k > (kmax — 
t). Some applications may even be just interested in the top-1 k- 
truss, i.e., the kmax -truss, since it represents the very heart of a 
network/graph. For such applications, applying the bottom-up ap- 
proach can be wasteful. Therefore, we propose a top-down ap- 
proach as a solution. 

6.1 Algorithm Framework 

Similar to the bottom-up approach, the framework of the top- 
down approach also consists of two main parts. 

Upper-bounding: This part computes a nontrivial upper bound 
on the truss number of each edge. 

Top-down truss decomposition: This part computes the top-t k- 
classes iteratively top-down, i.e., from the k max -class down 
to the (k max — t ~h 1) -class. We use the upper bound of the 
truss number to extract a small candidate subgraph to com- 
pute a /c-class. After obtaining a /c-class, we remove those 
edges that do not affect the computation of any k' -classes, 
for k' < k, in order to enhance the I/O and CPU perfor- 
mance for computing the /^-classes. 

Note that the above framework is similar to the top-down core 
decomposition framework [9]. However, the definition of /c-truss 
is based on triangles instead of simple vertex degree as in /c-core. 
Thus, the detailed operations in the algorithm are considerably dif- 
ferent. In particular, the top-down approach was shown to be more 
effective than the bottom-up approach for core decomposition [9], 
but is much less efficient for truss decomposition. This is mainly 
because the computed edges cannot be effectively removed (unlike 
the computed vertices in core decomposition) due to the more in- 
tricate definition of /c-truss. Therefore, the top-down approach is 
only effective for computing the top-t /c-classes, for some reason- 
ably small t. 

6.2 Upper Bounding 

To avoid operating on the entire input graph, the top-down ap- 
proach first obtains a candidate subgraph for computing the /c-class 
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Procedure 6 UpperBounding(G ne w) 

1. partition V Gnew into P = {Pi, P 2 , . . . , P P }, 
s.t. each NS(Pi) fits in memory; 

2. for each Pi e V do 

3. let if be the neighborhood subgraph NS(Pi) of P$; 

4. for each internal edge e = (it, v) of if do 

5. let x w (or x v ) be the maximum value of x s.t. 

there are x edges incident to it (or v), 
excluding (it, v), with support at least x; 

6. 0(e) ^ (ram{sitp(e), x u , x v } + 2); 



for a given fc. To extract a candidate subgraph for this purpose, we 
need to determine an upper bound of the truss number of the edges 
in the input graph G. 

We outline the algorithm for computing the upper bound in Pro- 
cedure 6. The input to Procedure 6 is a graph, G new , where each 
edge e in G new is associated with sup(e) (computed at Step 1 of 
Algorithm 7). The algorithm partitions G new intop > 2\G ne w\/M 
neighborhood subgraphs [13], so that each neighborhood subgraph 
if fits in memory. Then, for each if, the upper bound of the truss 
number of an internal edge e = (it, v) of if, denoted by -0(e), is 
computed as follows. 

Let w be an end vertex of the edge e = (it, v), i.e., w = u or 
w = v. We compute the maximum value of x w such that there 
are x w edges incident to w, excluding e, with support at least x w . 
Then, we set^(e) = (min{sup(e), x u , x v } + 2). 

Example 4. Consider Figure 2. For each edge e in the 5-class, 
ip(e) = 5. Next consider (d,g). The support sup((d,g)) is 3. 
Xd — 3 since there are 3 edges other than (d,g) incident to d with 
support 3. However x g = 2 since there are only 2 edges other than 
(d, g) with support at least 2. Hence ip((d, g)) — 2 + 2 = 4. 

The following lemma shows that ip(e) is an upper bound of 0(e). 

Lemma 2. Given a graph G and any neighborhood subgraph 
if of G, we have 0(e) < ip(e), where e — (u,v) is an internal 
edge of H and ip(e) is computed in if. 

PROOF. First, since e is an internal edge of if, all edges incident 
on it or v are also present in if. Suppose to the contrary that 0(e) > 
ip(e). Then, by the definition of /c-truss, e is contained in more 
than (ip(e) — 2) triangles, which also means that there are more 
than (ip(e) — 2) edges incident on both it and v with support more 
than (i/j(e) — 2). However, this implies that sup(e) > 00(e) — 2), 
x u > 00(e) — 2) and x v > 00(e) — 2), which contradicts the fact 
that -0(e) = (min{sup(e)i x u , x v } + 2). □ 

6.3 Enumerating Top-t Truss Classes 

The second part of the top-down approach mainly concerns with 
the computation of the top-t /c-classes, from kmax down to (kmax — 
t + 1), for a given t, as outlined in Algorithm 7. 

The first step in Algorithm 7 calls Algorithm 3. However, since 
the top-down approach has no need of the lower-bound on the truss 
number, <p(e), of each edge e, we do not compute ip(e) in Algo- 
rithm 3. Instead, we require sup(e) for computing an upper-bound 
on the truss number, as discussed in Section 6.2. That is, Steps 6-7 
of Algorithm 3 are not processed and we replace ip(e) by sup(e). 
However, Algorithm 3 still removes all the edges that belong to $2 
to reduce the search space for the later stage of top-down truss de- 
composition. Note that removing $2 does not affect the support of 
any other edge that is part of some triangle. 

The top-down computation then starts from the largest possible 
k based on the upper-bounds of the truss numbers, extracts a candi- 
date subgraph if for computing (Steps 4-6), compute from 



Algorithm 7 Top-Down Truss Decomposition 
Input: G = (V G , Eg) 

Output: the top-t /c-class, for kmax > k > (kmax — t) 

1. call Algorithm 3, but computing sup(e) instead of y?(e); 

2. call UpperBounding(G new ); 

3. k^ max{^(e) : e G E Gnew }; 

4. let U k = {v:ve V Gnew , 3e = (u, v) G E Gnew , s.t. ^(e) > k 
and Mi > k: e 

5. let if be the neighborhood subgraph NS(Uk) ofU^; 

6. scan G new to extract H; 

7. call Top-Down-Procedure(H ', k); 

8. k^(k-l); 

9. repeat Steps 4-8 until the top-t k -classes are computed 
or G new becomes empty; 



Procedure 8 Top-Down-Procedure(H ', k) 

1. compute sup(e) for each internal edge e of H in memory; 

2. while(3 internal edge e = (u, v) of H s.t. sup(e) < k — 2) 

3. for each triangle A uvw in H containing e do 

4. decrease the support of (u, w) and (v, w) by 1; 

5. remove e from if; 

6. remove any edge e G Tj (j > fc) from H and 

output all remaining internal edges of H as ; 

7. for each edge e = (it, v) G ^>j, where e in G new and j > k, do 

8. if (for every triangle A uvw in G n ew'- 3il, i2 > k s.t. 

(it, iu) G <E>a and (v, iu) G ^2) 

9. remove e from G new ; 



if (Step 7), and then moves on to (k — 1), until all the top-t k- 
classes are computed, or G ne w becomes empty (in which case all 
the /c-classes, for 2 < k < kmax, are computed). 

The candidate subgraph if is extracted from G new in the same 
way as Steps 3-5 of Algorithm 4, except that Uk is computed based 
on -0(e) instead of ip(e). However, the first value of k (let it be 
kist) computed by Step 3 of Algorithm 7 may be much greater 
than the true kmax, in which case we will need to repeat Steps 4- 
8 for (kist — kmax) times before the first non-empty /c-class, i.e., 
$fc mM , is computed. To avoid this, we may use the smallest k (let 
it be kinit) such that the corresponding candidate subgraph if fits in 
memory. We may then simply apply the in-memory truss decom- 
position algorithm to compute all /c-classes, for k in the range from 
kinit to kist. Then, we start the top-down process from (kinit — 1). 

Step 7 of Algorithm 7 calls Procedure 8 to compute from the 
candidate subgraph if. The algorithm is similar to the bottom-up 
procedure given in Procedure 5, except for the following differ- 
ences. First, we iteratively remove all internal edges of if with 
support less than (k — 2), and then output the remaining internal 
edges as Second, we remove an edge e from G ne w only if it 
is no longer involved in any triangle that contains an edge whose 
truss number is yet to be computed. 

Procedure 8 assumes that if can fit in main memory. We process 
the case that if cannot fit in main memory in a similar way as we 
do for the same case in the bottom-up computation, as discussed in 
Section 5.2. A similar version of Procedure 8 for handling the case 
that if cannot fit in memory is given in the Appendix. 

The following example illustrates how top-down truss decompo- 
sition is processed. 

Example 5. Consider top-down truss decomposition for our run- 
ning example in Figure 2. After Step 1 of Algorithm 7, Gnew con- 
tains all edges in G except for (i,k). After calling Procedure 6, 
UpperBounding(Gnew), we get all the -0(e) values for edges e in 
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(a) U 5 (b) U 4 



Figure 5: Top-Do wn Truss Decomposition for t — 2 

Gnew, and k is set to 5 in Step 4 of Algorithm 7. The induced 
subgraph of U5 in G new is shown in Figure 5(a). H = NS(U$) 
contains also vertices f,g,l,k. Next we call Procedure 8 to ex- 
tract $5, which is determined to be the graph in Figure 5(a). We 
can remove all of these edges except for (e, d), which is involved 
in Aedg and the truss numbers of (e, g) and (d, g) are not > 5. 
Next we set k = 4, and repeat Steps 4-8 of Algorithm 7. The in- 
duced subgraph of U4 in G new is shown in Figure 5(b). Note that 
the upper bounds ip(e) for all edges here are 4 except for (e,d), 
ip{{e, d)) = 5. H = NSQJ4) also contains vertices I and k. Next 
we run Procedure 8 to compute $4, which is the set of edges in the 
clique {f,h,i,j}. The 6 edges involved are then removed from 
Gnew except for (f,h). Since t = 2, the algorithm terminates 
here. 

6.4 Correctness and Complexity 

We first prove the correctness of Algorithm 7 as follows. 

THEOREM 4. Given a graph G, Algorithm 7 correctly com- 
putes the top-t <J>fc in G,for k m ax > k > (kmax - t). 

PROOF. In order to prove the correctness of Algorithm 7, we 
prove that Procedure 8 (or Procedure 10 in the Appendix) correctly 
computes $fc, we need to show that the following statement holds, 
Statement T. the candidate subgraph H contains all edges of &k 
as internal edges and all edges that form triangles with any edge 
in let us refer to this set of edges as If Statement 1 is true, 
then Procedure 8 (10) returns since these procedures simply 
operate by following the definition of Statement 1 is true if 
Gnew contains <I>^ because H is the neighborhood subgraph of 
all edges e in G ne w with unknown truss number where cp(e) > 
k. From Lemma 2, this set of edges includes all edges of if 
Gnew contains <I>^ at the time H is to be extracted at Steps 4-6 of 
Algorithm 7. Next we prove Statement 2: G new contains <£^. 

When k > k max , apparently G new contains since no edge 
has been removed from G new yet. According to Steps 7-9 (or Steps 
15-17 of Procedure 10 in Appendix), edges are removed from Gnew 
only if they are no longer involved in any triangle that contains an 
edge whose truss number is yet to be computed. Thus, by induction 
on k for Statement 2 and the correctness of Algorithm 7, with a 
base case of k = kmax, G ne w always contains <I>^ when H is 
to be extracted for the enumeration of the /c-class and the theorem 
follows. □ 

Next, we analyze the complexity of Algorithm 7. 

Theorem 5. Let G be the input graph. Let 1~L be the set of 
candidate subgraphs extracted at Steps 4-6 of Algorithm 7. Let 
7-L' C 7-L be the subset of candidate subgraphs that cannot fit in 
memory. Algorithm 7 computes the top-t &k in G, for kmax > 
k > (k max t)> using 0((^f H - t) sca,n(\G\) + X^i/gH 7 I^^H) 
I/Os andO(m 15 + Y^h^u \ Ah \) CPU time. 

Proof. The analysis is similar to that of Theorem 3. □ 



7. EXPERIMENTAL EVALUATION 

We compare the performance of our algorithms with the existing 
in-memory and MapReduce algorithms [15, 16]. All the sequen- 
tial algorithms were tested on a machine with the Intel Core2 Duo 
2.80GHz CPU, 4GB RAM, and the Ubuntu 11.04 operating sys- 
tem. The MapReduce algorithm [16] was ran on an Amazon Elas- 
tic MapReduce cluster with 20 nodes, each of which has the com- 
puting capacity of a 1.0 GHz 2007 Xeon processor, 1.7GB RAM, 
and 160GB instance storage; the Hadoop (version 0.20.205) im- 
plementation of MapReduce is deployed and the default setting is 
assumed. 

Datasets. We use the following nine datasets: Gnutella Internet 
peer-to-peer network (P2P), High Energy Physics collaboration 
network (HEP), Amazon product co-purchasing network (Amazon), 
Wikipedia Talk network (Wiki), Autonomous systems by Skitter ( Skit- 
ter), Live Journal (LJ), Blogs (Blog), Billion Triple Challenge (BTC), 
and World Wide Web of UK (Web). The first six are from the Stan- 
ford Network Analysis Project (snap.stanford.edu). P2P represents 
hosts as vertices and the connections between the hosts as edges. 
HEP represents each paper as a vertex and each citation between 
two papers as an edge. Amazon represents products as vertices and 
edges exist between frequently co-purchased products. Wiki repre- 
sents Wikipedia users as vertices and an edge indicates that a user 
once edited a talk page of another user. Skitter describes an internet 
topology constructed from several sources to about a million des- 
tinations. LJ is from the free online community Live Journal (live- 
journal.com), which represents members as vertices and friend hip 
as edges. Blog is from the blogs network and has vertices as blogs 
and an edge indicates that two blogs appear in the same search re- 
sult of the top- 15 queries published by Technorati (technorati.com). 
BTC is an RDF graph cosntructed from the Billion Triple Chal- 
lenge (vmlion25.dei.de). Web is from the Yahoo! webspam dataset 
(barcelona.research.yahoo.net) in which webpages are represented 
as vertices and hyperlinks as edges. Some statistics of the datasets 
are shown in Table 2. Note that the median degree of most datasets 
is rather small due to the heavy tail of power-law degree distribu- 
tion observed in these graphs. 

Table 2: Statistics of datasets (K = 10 3 , M = 10 6 , G = 10 9 ): 
the number of vertices and edges (\Vg\ and |-Eg|)» disk storage 
size (in bytes), maximum and median degree (d ma x and d me d\ 
and the largest k for any /c-truss {kmax) 





\Vg\ 


\E G \ 


size 


dmax 


dmed 


kmax 


P2P 


6.3K 


41. 6K 


237K 


97 


3 


5 


HEP 


9.9K 


52.0K 


317K 


65 


3 


32 


Amazon 


0.4M 


3.4M 


47.9M 


2752 


10 


11 


Wiki 


2.4M 


5.0M 


66.5M 


100029 


1 


53 


Skitter 


1.7M 


11. 0M 


149.1M 


35455 


5 


68 


Blog 


1.0M 


12.8M 


177.2M 


6154 


2 


49 


LJ 


4.8M 


69M 


809. 1M 


20333 


5 


362 


BTC 


165M 


773M 


10.0G 


1637619 


1 


7 


Web 


106M 


1092M 


12.2G 


36484 


2 


166 



7. 1 Performance of In-Memory TD Algorithms 

We first assess the efficiency of the improved in-memory algo- 
rithm (i.e., Algorithm 2), denoted by TD-inmem+ (TD for Truss 
Decomposition), compared with the in-memory algorithm [15], de- 
noted by TD-inmem. 

Table 3 reports the results on the following four datasets, Wiki, 
Amazon, Skitter, and Blog, which can fit in memory. For the 
other larger datasets, LJ, BTC, and Web, both algorithms did not 
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complete within reasonable time (longer than a week) due to large 
amount of memory usage (over 4GB memory). 



Table 3: Running time (wall-clock time in seconds) and peak 
memory usage (bytes) of TD-inmem+ and TD-inmem 





Wiki 


Amazon 


Skitter 


Blog 


Time (TD-inmem) 


8856 


68 


9204 


1261 


Time (TD-inmem+) 


121 


31 


281 


361 


Speedup ratio 


73.2 


2.2 


32.8 


3.5 


Mem-usage (TD-inmem) 


966M 


295M 


1.4G 


715M 


Mem-usage (TD-inmem+) 


846M 


398M 


1.6G 


1.1G 



The results show that TD-inmem+ is apparently much faster than 
TD-inmem for all datasets. The speedup is from 2.2 times up to 
73.2 times faster, with comparable memory consumption. The re- 
sults suggest that our improved in-memory algorithm not only at- 
tains a lower theoretical time complexity, but also significantly im- 
proves the efficiency in practice. 

7.2 Performance of Bottom-Up TD Algorithm 

We evaluate the performance of the I/O-efficient bottom-up algo- 
rithm, denoted by TD-bottomup, compared with Cohen's MapRe- 
duce algorithm [16] that also does not require to keep the entire 
input graph in main memory, denoted by TD-MR. 

Table 4 reports the running time of TD-bottomup and TD-MR. 
Note that when the input graph can fit in memory, our 1/O-efficient 
algorithms are simply the in-memory algorithm, TD-inmem+. Thus, 
we skipped the smaller datasets (which are shown in Table 3) and 
ran TD-bottomup on the larger networks, LJ, BTC, and Web. 

However, we were not able to obtain the result on these large 
datasets for TD-MR because it is at least more than 3 orders of 
magnitude slower than our algorithm. In fact, we were only able 
to obtain the results for TD-MR on the two smallest datasets, HEP 
and P2P, as shown in Table 4. 

Table 4: Running time (wall-clock time in seconds) of TD- 
bot tomup and TD-MR 





P2P 


HEP 


LJ 


BTC 


Web 


Time (TD-bottomup) 


<1 


<1 


664 


1768 


6314 


Time (TD-MR) 


4200 


14760 









The results show that TD-bottomup takes less than 1 second to 
perform truss decomposition in HEP and P2P, but TD-MR takes 
4200 and 14760 seconds on 20 machines. The drastically worse 
performance of TD-MR is because MapReduce is not a suitable 
framework for the task of truss decomposition. On the contrary, 
TD-bottomup uses a single machine and takes less or similar amount 
of time on datasets with size over 30000 times larger than HEP. The 
results thus verify that our I/O-efficient algorithm is more feasible 
for truss decomposition in massive networks. 

7.3 Performance of Top-Down TD Algorithm 

For the top-down algorithm, denoted by TD-topdown, we focus 
on the three large graphs, LJ, BTC, and Web. 

We used TD-topdown to compute both the top-20 /c-classes and 
all the /c-classes. Table 5 reports the running time of TD-topdown, 
where we also show the running time of TD-bottomup as a refer- 
ence. 

The results show that for LJ and Web, the top-down approach 
has a significant benefit in computing the top-20 results than the 
bottom-up approach. For the BTC dataset, since 

kmax ^ 20, TD- 

topdown computes all the /c-classes and has a comparable speed 



Table 5: Running time (wall-clock time in seconds) TD- 
topdown and TD-bottomup 





LJ 


BTC 


Web 


TD-topdown (top-20) 


149 


1744 


2354 


TD-topdown 


941 


1744 




TD-bottomup 


664 


1768 


6314 



to that of TD-bottomup. However, TD-topdown is about 6.3 times 
slower than TD-bottomup for computing all the /c-classes, and it 
cannot finish within reasonable time for the largest dataset, i.e., 
Web. The results thus show that TD-topdown is suitable for com- 
puting the top-t results for some small t or for processing datasets 
that have a small k max . 

7.4 K-Truss vs. K-Core 

In this experiment, we show that /c-truss is better than /c-core 
as a type of cohesive subgraphs. We compute the /c-core that has 
the maximum core number, i.e., the non-empty /c-core that has the 
largest value of k. We use c max to denote the maximum core num- 
ber, in order to distinguish it from the maximum truss number (i.e., 
since c max / k m ax in general. We denote the k m ax -truss 
by T and the c ma x-core by C in this experiment. 

Table 6: Statistics of the kmax -truss, T, and the c maa ;-core, C, 
of various datasets (K = 10 3 ): the number of vertices (Vt/Vc), 
the number of edges (E T /Ec)<> the maximum truss/core num- 
ber {kmaxlc-max ), and the clustering coefficient (CCt/CCc) of 
T and C, respectively 





Vt/Vc 




k-maxl Cmax 


CCtICCc 


Amazon 


5K/33K 


55K/442K 


11/10 


0.99/0.72 


Wiki 


237/700 


32K/147K 


53/131 


0.64/0.42 


Skitter 


185/222 


16K/33K 


68/111 


0.95/0.71 


Blog 


49/387 


2K/54K 


49/86 


1.00/0.52 


LJ 


383/395 


146K/155K 


362/372 


1.00/0.99 


BTC 


653/1295 


10K/838K 


7/641 


0.45/0.00002 


Web 


498/862 


82K/148K 


166/165 


1.00/0.59 



Table 6 reports some statistics of T and C. The results show that 
the size of T, in terms of both the number of vertices and edges, 
is significantly smaller than that of C, indicating that the "core" of 
a network represented by the kmax -truss (i.e., T) and that by the 
Cmax -core (i.e., C) are radically different. 

From another angle, the result that k ma x is much smaller than 
Cmax for most datasets implies that although the c ma x-core is a sub- 
graph in which vertices have a large number of edge-connections to 
each other, most of these connections do not really form tightly-knit 
clusters or communities as do in the kmax -truss. This is verified by 
the clustering coefficient [33] of T and C, i.e., CCt and CCc 
in Table 6. The result clearly indicates that the kmax -truss and its 
vertices tend to form clusters much more likely than the c macc -core. 

It is also worth noting that kmax and c ma x are differed by only 
1 for Amazon and Web. Since the /c-truss is a (k — l)-core but 
not vice versa, e.g., the 11-truss of Amazon is a 10-core or more 
precisely a subgraph of the 10-core, the result that the 11-truss is 
so much smaller than the 10-core reveals that out of a large part of 
a network seemingly to be well-connected, it is possible that only a 
small portion is truly tightly connected. 

The /c-core can be used as an effective heuristic for maximal 
clique enumeration [17], since a clique of size k must be in a 
(k — l)-core, which can be significantly smaller than the original 
graph. However, our result shows that the /c-truss can be a much 
better candidate since its size is in general much smaller than the 
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/c-core. Note that triangles are fundamental units in a clique and a 
clique of size k must be in a /c-truss. 

We also know that the size of the maximum clique is bounded 
by (cmax + 1) and kmax- Thus, our result shows that k ma x gives a 
much lower upper bound on the size of the maximum clique. For 
example, we know that the maximum clique in the Wiki graph has 
at most 53 vertices as bounded by kmax, instead of 132 vertices as 
bounded by (c m ax + 1). 

In conclusion, the results demonstrate that the "core" of a net- 
work represented by the k ma x -truss (i.e., T) is significantly more 
cohesive or tightly-knit than that by the c ma x-core (i.e., C). The 
results verify that triangle-based connection (as in /c-truss) is more 
robust than edge-based connection (as in /c-core). As cohesive sub- 
graphs are useful for studying important properties (e.g., connec- 
tivity, robustness, centrality, etc.) of a network, /c-truss is a better 
candidate than fc-core for network analysis and related tasks. In 
addition, we also show that /c-truss can be employed as a more 
effective heuristic for maximal clique enumeration and maximum 
clique finding. 

8. RELATED WORK 

The most closely related works to /c-truss are the cohesive sub- 
graphs [18], such as clique [21, 7], n-clique [22], /c-plex [29], n- 
clan [24], n-club [24], various types of quasi-cliques [23, 1], and 
/c-core [28], which we have discussed in Section 1. 

In addition to the cohesive subgraphs, /c-truss is also related to 
dense subgraphs, in particular, the DN-graph [31], which is a con- 
nected subgraph in which the lower bound on the number of tri- 
angles of edges is locally maximized. Their definition renders the 
problem NP-hard and their solution is approximate. 

The only existing algorithms for truss decomposition were pro- 
posed by Cohen [15, 16]. Their first algorithm is an in-memory 
algorithm [15], which is slow for handling large real- world graphs 
with power-law distribution. Our in-memory algorithm improves 
their algorithm by removing the bottleneck for processing vertices 
with high degree. Their second algorithm is a MapReduce algo- 
rithm [16], which is actually not suitable for the task of truss de- 
composition due to the iterative process that blocks parallelization. 
We verified in our experiments that this MapReduce algorithm is 
inefficient and cannot handle large graphs. On the contrary, our 
I/O-efficient algorithm handles large graphs efficiently. 

The framework of our top-down algorithm is similar to that of 
the algorithm for core decomposition [9], but the detailed design of 
our algorithms is totally different from theirs and also more com- 
plicated than theirs. In particular, the strong triangular connection 
within the /c-truss does not allow effective pruning in top-down 
truss decomposition as in top-down core decomposition [9]. As 
a solution for finding /c-truss for all k, this paper proposes a more 
effective bottom-up approach. In direct contrast, the bottom-up ap- 
proach is not suitable for core decomposition in large graphs [9], 
since the /c-cores for small k are generally too large. 

Truss decomposition is different from triangle counting [27, 20] 
since triangle counting is only one step invoked in the iterative pro- 
cess of A; -truss computation, and any efficient triangle counting al- 
gorithm can be applied. In particular, we apply the I/O-efficient al- 
gorithms [14, 13] to compute the support of edges. However, ensur- 
ing I/O-efficient support counting is only one part of our algorithms 
and we still have to ensure all the other steps in truss decomposition 
also I/O-efficient. For example, the bottom-up/top-down steps for 
iterative truss decomposition. 

Other related work includes 1/O-efficient maximal clique enu- 
meration [10, 11, 12]. Their algorithms cannot be extended to 
compute /c-truss. In fact, both the graph partitioning step and the 



subgraphs being extracted out of the partition in their algorithms 
are different from our algorithms. 

9. CONCLUSIONS 

We proposed efficient algorithms for truss decomposition: an in- 
memory for fast truss decomposition in networks of moderate size, 
a bottom-up I/O-efficient algorithm for massive networks that are 
too large to fit in memory, and a top-down algorithm tailored for 
applications that prefer the top-t /c-trusses. We verified by experi- 
ments on a range of real datasets that our algorithms significantly 
outperform the existing in-memory algorithm [15] and the MapRe- 
duce algorithm [16] on both small and large networks. We also 
showed that /c-truss is more suitable than /c-core as a cohesive sub- 
graph and it reveals tightly-knit clusters of a network. 
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APPENDIX 

Procedure 9 is called at Step 6 of Algorithm 4 in Section 5.2, to 
compute &k in the candidate subgraph H for the case when H 
cannot fit in memory. The framework of Procedure 9 is similar 
to Algorithm 3, while the computation of the &k edges at Steps 
7-11 of Procedure 9 is similar to that at Steps 1-5 of Procedure 
5. The detailed explanation is thus omitted here but can be found 
in Sections 5.1 and 5.2 where Algorithm 3 and Procedure 5 are 
discussed. 



Procedure 9 Bottom-Up-Procedure-2(H, k) 

1. sup(e) <— for all internal edges of H; 

2. H' <- H; 

3. while (not all edges in H ' are removed) 



4. partition V H > into V = {Pi , P2 , • . . , P P }, 

s.t. each Pi G V fits in memory; 

5. for each Pi G V do 

6. let F be the neighborhood subgraph NS(Pi) of Pi\ 

7. compute sup(e) for each internal edge e of F; 

8. while (3 internal edge e = (u, v) of F, 

s.t. sup(e) < (k - 2) ) 

9. output e as a edge; 

10. for each triangle A uvw in F containing e do 

11. decrease the support of (u, w) and (v, w) by 1; 

12. remove e from F, H, and G new ; 

13. remove all internal edges of F from H 



14. repeat Steps 2-13 until all remaining internal edges of H 
have support greater than (k — 2); 



Procedure 10 is called at Step 7 of Algorithm 7 in Section 6.3, 
to compute in the candidate subgraph H for the case when H 
cannot fit in memory. The framework of Procedure 10 is similar 
to Procedure 9, while the other computation details are the same as 
Procedure 8. 



Procedure 10 Top-Down-Procedure-2(H ', k) 

1. sup(e) <— for all internal edges of H; 

2. H' «— H; 

3. while (not all edges in H ' are removed) 



4. partition V H , into V = {Pi , Pi , . . . , P p }, 

s.t. each Pi G V fits in memory; 

5. for each Pi G V do 

6. let F be the neighborhood subgraph NS(Pi) of Pi\ 

7. compute sup(e) for each internal edge e in F; 

8. while (3 internal edge e = (u, v) of F, 

s.t. sup(e) < (k - 2) ) 

9. for each triangle A uvw in F containing e do 

10. decrease the support of (u, w) and (v, w) by 1; 

1 1 . remove e from F and H; 

12. remove all internal edges of F from H'\ 



13. repeat Steps 2-12 until all remaining internal edges of H 
have support at least (k — 2); 

14. remove any edge e G Tj (j > k) from H and 

output all remaining internal edges of H as ^ ; 

15. for each edge e = (u, v) G <E>j, where e in G new and j > k, do 

16. if(for every triangle A UV w in G new : 3il, i2 > k s.t. 

(u, w) G <E>a and (v, w) G ^2) 

17. remove e from G new \ 
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